Functional Elements and POS Categories

نویسندگان

  • Qiuye Zhao
  • Mitch Marcus
چکیده

We propose a bootstrapping algorithm which successfully resolves two fundamental tasks: morphology acquisition and the acquisition of a subset of functional words. Given the outputs of these fundamental tasks, we build a nearly state-of-art morphology analyzer performing with a F1-score of 80.94%; also, we can improve the baseline model for acquiring functional words by an absolute error reduction of 26%. Furthermore, with these acquisition outputs, a minimally supervised tagging system proposed before can be turned into a totally unsupervised one, achieving a tagging accuracy of 85.26% for openclass words.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A radical extension of the category of $S$-sets

Let S-Set be the category of $S$-sets, sets together with the actions of a semigroup $S$ on them. And, let S-Pos be the category of $S$-posets, posets together with the actions compatible with the orders on them. In this paper we show that the category S-Pos is a radical extension of S-Set; that is there is a radical on the category S-Pos, the order desolator radical, whose torsion-free class i...

متن کامل

Morita theorems for partially ordered monoids

Two partially ordered monoids S and T are called Morita equivalent if the categories of right S-posets and right T -posets are Pos-equivalent as categories enriched over the category Pos of posets. We give a description of Pos-prodense biposets and prove Morita theorems I, II, and III for partially ordered monoids.

متن کامل

The production of lexical categories (VP) and functional categories (copula) at the initial stage of child L2 acquisition

This is a longitudinal case study of two Farsi-speaking children learning English: ‘Bernard’ and ‘Melissa’, who were 7;4 and 8;4 at the start of data collection. The research deals with the initial state and further development in the child second language (L2) acquisition of syntax regarding the presence or absence of copula as a functional category, as well as the role and degree of L1 influe...

متن کامل

Probabilistic Models of Short and Long Distance Word Dependencies in Running Text

This article describes two complementary models that represent dependencies between words in loca/ and non-local contexts. The type of local dependencies considered are sequences of part of speech categories for words. The non-local context of word dependency considered here is that of word recurrence, which is typical in a text. Both are models of phenomena that are to a reasonable extent doma...

متن کامل

The Role of Parts-of-Speech in Feature Selection

This research explores the role of parts-of-speech (POS) in feature selection in text categorization. We compare the use of different POS, namely nouns, verbs, adjectives and adverbs with a feature set that contains all POS. The best results are obtained with the use of only nouns. Therefore, we make use of a WordNet-based POS feature selection approach using the nouns feature set to compare wi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011